Automatic index creation for handwritten notes
نویسندگان
چکیده
This paper describes a technique for automatically creating an index for handwritten notes captured as digital ink. No text recognition is performed. Rather, a dictionary of possible index terms is built by clustering groups of ink strokes corresponding roughly to words. Terms whose distribution varies significantly across note pages are selected for the index. An index page containing the index terms is created, and terms are hyper-linked back to their original location in the notes. Further, index terms occurring in a note page are highlighted to aid browsing.
منابع مشابه
Scale space technique for word segmentation in handwrittenmanuscriptsR
Indexing large archives of historical manuscripts is required to allow rapid perusal by scholars and researchers who wish to consult the original manuscripts. However, automatic conversion of handwritten manuscripts to digital form allowing eecient storage and retrieval of the original documents is a challenging problem. Word spotting is a scheme to index such data. The important steps in this ...
متن کاملOff-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملHandwritten Notes as a Visual Interface to Index, Edit and Publish Audio/Video Highlights
Digital libraries aim to make media-rich information accessible to "anyone, anywhere, anytime." However, digital audio and video are difficult to search and share. This paper describes Souvenir, a system which enables people to use their handwritten or text notes to retrieve and share specific media moments. Souvenir enables users to take time-stamped notes on a variety of devices, such as the ...
متن کاملMedical Informatics – Technological Innovations Medical Records – Data Processing Natural Language Processing Semantics – Data Processing Temporal Event Analysis Text Processing (computer Science) Natural Language Processing and Temporal Information Extraction in Emergency Department Triage Notes
Electronic patient records, including the Emergency Department (ED) Triage Note (TN), provide a rich source of textual information. Processing clinical texts to create important pieces of structured information will be useful to clinicians treating patients, clinicians in training, and researchers and practitioners in biosurveillance. This work applies natural language processing (NLP) and info...
متن کاملEstimating Writing Neatness from Online Handwritten Data
Handwriting is the most fundamental expressive activity in learning. In order to utilize the nature, digital pen technology has been emerged to capture notes and transfer them. We have developed AirTransNote, a student note-sharing system that facilitates collaborative and interactive learning in conventional classrooms. With AirTransNote system, a teacher can immediately share student notes to...
متن کامل